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PRECONDITIONING TECHNIQUES IN FRAME THEORY AND 
PROBABILISTIC FRAMES 


KASSO A. OKOUDJOU 


Abstract. In this chapter we survey two topics that have recently been in¬ 
vestigated in frame theory. First, we give an overview of the class of scalable 
frames. These are (finite) frames with the property that each frame vector 
can be rescaled in such a way that the resulting frames are tight. This process 
can be thought of as a preconditioning method for finite frames. In particular, 
we: (1) describe the class of scalable frames; (2) formulate various equivalent 
characterizations of scalable frames, and relate the scalability problem to the 
Fritz John ellipsoid theorem. Next, we discuss some results on a probabilistic 
interpretation of frames. In this setting, we: (4) define probabilistic frames as 
a generalization of frames and as a subclass of continuous frames; (5) review 
the properties of certain potential functions whose minimizers are frames with 
certain optimality properties. 
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1. Introduction 

This chapter is devoted to two topics that have been recently investigated within 
frame theory: (a) frame preconditioning methods investigated under the vocable 
scalable frames, and (b) probabilistic methods in frame theory referred to as prob¬ 
abilistic frames. Before getting to the details on each of these topics we briefly 
review some essential facts on frame theory and refer to [m sa Eoi EH for more 
on frames and their applications. In all that follows we restrict ourselves to (finite) 
frames in . 

1.1. Review on finite frame theory. 

Definition 1.1. A set is a frame for if3A,B > 0 such 

that \/x G 

M 

MxV < < S||xf. 

fc=i 

If, in addition, each is unit-norm, we say that $ is a unit-norm frame. The set 
of frames for with M elements will be denoted by and simply if if 

M and N are fixed. In addition, we shall denote the subset of unit-norm frames by 
TuiM,N), i.e., 

N) := {{ipkffii e TiM, N) : ||^,.||2 = 1 for fc = 1,..., M} . 

We shall investigate frames via the analysis operator, L, defined by 

L : —)• Lx = {{x, 

The synthesis operator is the adjoint L* of L and is defined by 

M 

L* : ^ : c = (cfe)^i ^ L*c = ^ Ck^Pu- 

k=l 

It is easily seen that the canonical matrix associated to L* is the N x M matrix 
whose column is the frame vector pk- As such we shall abuse notation and 
denote this matrix by $ again. Consequently, the canonical matrix associated with 
L is simply the transpose of $. 

The frame operator S = L*L is given by 

M 

S : —)■ : X I—)■ Sx = '^^{x, pk)pk, 

k=l 

and its matrix will be denoted (again) by S with 

S = 

The Gramian (operator) of the frame is defined by 

G = LL* = 

In fact, the Gramian in an M x M matrix whose {i,))*^ entry is {pj,pi). 

$ = {pk}^=i C is a frame if and only if S' is a positive definite matrix on 
In this case. 


= {s-\k}^=. 
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is also a frame, called the canonical dual frame, and, for each x G 

M M 

( 1 - 1 ) X = y^fx, (pk)f’k=ifk)‘Pk- 


, we have 


/c=l 


A frame $ is a tight frame if we can choose A = B. In this case the frame operator 
is simply a multiple of the identity operator. To any frame $ = C is 

associated a canonical tight frame given by 


WUtU = C 


such that for every x G 


( 1 . 2 ) 


M 




k' 


If $ is a tight frame of unit-norm vectors, we say that $ is a finite unit-norm 


tight frame (FUNTF). In this case, the reconstruction formula (1.1) reduces to 


(1.3) 


Vx ^ ' 


M 

k=l 


FUNTFs are one of the most fundamental objects in frame theory with several 
applications. Other chapters in this volume will delve more into FUNTFs. The 


reconstruction formulas (1.3) and (1.2) are very reminiscent of the expansion of a 
vector in an orthonormal basis for MA However, due to the redundancy of frames, 
the coefficients in these reconstruction formulas are not unique. But the “simplic¬ 
ity” of these reconstruction formulas makes the use of tight frames very attractive 
in many applications. This in turn, spurs the need for methods to characterize and 
construct tight frames. 

A major development in this direction is due to Benedetto and Fickus [7] who 
proved that for each $ = {g>k}kLi C such that ||<pfe|j = 1 for each k, we have 


(1.4) 


M M 


FP($) 


j=i k=i 


where FP($) is the frame potential. The bound given in (1.4) is the global minimum 


of FP and is achieved by orthonormal systems if M < A^, and by tight frames 
if M > N. Casazza, Fickus, Kovacevic, Leon and Tremain m extended this 
result by removing the condition that the frames are unit norm. In essence, these 
results suggest that one may effectively search for FUNTFs by minimizing the frame 
potential. In practice, techniques such as the steepest-descent method can be used 
to find the minimizers of the frame potential [niisi]. For other related results on 
the frame potential we refer to [33l 0^ , and na Chapter 10]. The frame potential 
and some of its generalization will be considered in Section 

Construction of FUNTFs has seen a lot of research activities in recent years and 
as a result a number of construction methods have been offered. Casazza, Fickus, 
Mixon, Yang and Zhou [18j introduced the spectral tetrix method for constructing 
FUNTFs. This method has been extended to construct all tight frames, [BH IMlIKH] . 
There have also been other new insights in the construction of tight frames, leading 
to methods rooted in differential and algebraic geometry, see [niiiEniin]- Some 
of these methods will be introduced in some of the other chapters of this volume. 
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1.2. Scalable frames. While these powerful algebro-geometric methods can con¬ 
struct all tight frames with given vector lengths and frame constant, they have not 
been able to incorporate any extra requirement. To put it simply, constructing 
application-specific FUNTFs involves extra construction constraints, which usually 
makes the problem very difficult. However, one could ask if a (non) tight frame 
can be (algorithmically) transformed into a tight one. An analogous problem has 
been investigated for decades in numerical linear algebra. Indeed, precondition¬ 
ing methods are routinely used to convert large and poorly-conditioned systems of 
linear equations Ax = b, into better conditioned ones liisaiia. For example, a 
matrix A is (row/column) scalable if there exit diagonal matrices Di,D 2 with pos¬ 
itive diagonal entries such that DiA, AD 2 , or D 1 AD 2 have constant row/column 
sum, [1131 iZl Hi ET]. Matrix scaling is part of general preconditioning schemes 
in numerical linear algebra His]- 

One of the goals of these notes is to survey recent developments in precondition¬ 
ing methods in finite frame theory. In particular, we describe recent developments 
in answering the following question: 

Question 1.2. Is it possible to (algorithmically) transform a (non-tight) frame into 
a tight frame? 

In Section we outline a convex geometric approach that has been proposed 
to answer this question. For example, we consider the case of solving Question |1.2| 
using some classes of scaling matrices to transform a non-tight frame into a tight 
one. More specifically, we give an overview of recent results addressing the following 
problem. 

Question 1.3. Is it possible to rescale the norms of the vectors in a (non-tight) 
frame to obtain a tight frame? 

Frames that answer positively Question [L^ are termed scalable frames, and were 
characterized in [S3] , see also |53| . This characterization is operator-theoretical and 
solved the problem in both the finite and the infinite dimensional settings. More 
precisely, in the finite dimensional setting, the main result of |52j characterizes the 
set of non scalable frames and gives a simple geometric condition for a frame to 
be non scalable in dimensions 2 and 3, see Section [2^ Other characterizations of 
scalable frames using the properties of the so-called diagram vector (mi) appeared 
in (33] . We refer to m for some other results about scalable frames. 

1.3. Probabilistic frames. While frames are intrinsically defined through their 
spanning properties, in real euclidean spaces, they can also be viewed as distri¬ 
butions of point masses. In this context, the notion of probabilistic frames was 
introduced as a class of probability measures with finite second moment and whose 
support spans the entire space 1331301 ED- Probabilistic frames are special cases of 
continuous frames as introduced by S. T. Ali, J.-P. Antoine, and P.-P. Gazeau [T|, 
see also [35] . 

In Section [^ we consider frames from this probabilistic point of view. To begin 
we note that probabilistic frames are extensions of the notion of frames previously 
defined. Indeed, consider a frame, $ = for and define the probability 

measure 
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where 5x is the Dirac mass at x G It is easily seen that the second moment of 
is finite, i.e., 

M 

\\x\\^dfics>{x) = if XI 

fc=i 

and the span of the support of is Thus, each (finite) frame can be associated 
to a probabilistic frame. We shall present other examples of probabilistic frames 
associated to <i> in Section By analogy to the theory of finite frame, we shall 
say that a probability measure on with finite second moment is a probabilistic 
frame if the linear span of its support is . 

It is known that the space 7^2 of probability measures with finite second 
moments can be equipped with the (2-)Wasserstein metric. In this setting, many 
questions in frame theory can be seen as analysis problems on a subset of the 
Wasserstein metric space 7^2 (R^)- In Section we introduce this metric space 
and derive some immediate properties of frames in this setting. Moreover, this 
probabilistic setting allows one to use non-algebraic tools to investigate questions 
from frame theory. For instance, tools like convolution of measures has been used 
in [32] to build new (probabilistic) frames from old ones. 

One of the main advantages in analyzing frames in the context of the Wasserstein 
metric lies in the powerful tools available to solve some optimization problems 
involving frames using the framework of optimal transport. While we will not 
delve into any details in this chapter, we point out that optimization of functionals 
like the frame potential can be studied in this setting. For example, C. Wickman 
recently showed that a potential function that generalizes Benedetto and Fickus’s 
frame potential can be minimized in the Wasserstein space using some optimal 
transport notions [82]. In the last part of the lecture, we shall focus on a family of 
potentials that generalize the frame potential and present a survey of recent results 
involving their minimization. In particular, this family includes the coherence of a 
set of vectors, which is important quantity in compressed sensing [niEsiisi] , as 
well as a functional whose minimizers are, in some cases, solutions to the Zauner’s 
conjecture [H31IM] . 

Though we shall not elaborate on these here, it is worth mentioning that prob¬ 
abilistic frames are related to many other areas including: (a) the covariance of 
matrices multivariate random vectors [ST] US] [TT] [75] ; (b) directional statistics 
where there are used to test whether certain data are uniformly distributed; see, 
EUEH]; (c) isotropic measures [371 m], which, as we shall show, are related to 
the class of tight probabilistic frames. We refer to |32j for an overview of other 
relationships between probabilistic frames and other areas. 

The rest of this chapter is organized as follows. In Section [^ we give an overview 
of the recent developments on scalable frames or preconditioning of frames. In 
Section [^ we deal with probabilistic frames and outline the recently introduced 
Wasserstein metric tools to deal with frame theory. 

2. Preconditioning techniques in frame theory 

Scalable frames are frames $ = {^Pk}^=i for for which there exist nonnegative 
scalars {ck\^L^ C [0,oo) such that 



$ = {Ck^k}k=l 
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is a tight frame. Scalable frames were first introduced and characterized in [52]. 
Both infinite and finite dimensional settings were considered. In this section, we 
only focus on the latter giving an overview of recent methods developed to un¬ 
derstand scalable frames. The results that we shall describe give an exact char¬ 
acterization of the set of scalable frames from various perspectives. However, the 
important and very practical question of developing algorithms to find the weights 
{cfc} that make the frame tight will not be considered here, and we refer to |53| for 
a sample of results on this topic. Similarly, when a frame fails to be scalable, one 
could seek to relax the tightness condition and seek an “almost scalable frame”. 
These considerations are sources of ongoing research and will not be taken upon 
here. Finally, it is worth pointing out a very interesting application of the theory of 
scalable frames to wavelets constructed from the Laplacian Pyramid scheme [44] . 

The rest of this section is organized as follows. In Section [O] we define scalable 
frames and derive some of their elementary properties. We then outline a char¬ 
acterization of the set of scalable frames in terms of certain convex polytopes in 
Section 2.2 This characterization is preceded by motivating examples of scalable 


frames in dimension 2. In Sections |2.3| and |2.4| we give two other equivalent char¬ 
acterizations of scalable frames. The first of these characterizations has geometric 
interpretation, while the second one is based on Fritz John’s ellipsoid theorem. 


2.1. Scalable frames: Definition and properties. The following definitions of 
scalable frames first appeared in [^ |S3| : 

Definition 2.1. Let M,N be integers such that N < M. A frame $ = {<Pk}^=i 
in is called scalable, respectively, strictly scalable, if there exist nonnegative, 
respectively, positive, scalars {ck}^,^i such that {cfe</5fc}^i is a tight frame for 
The set of scalable frames, respectively, strictly scalable frames, is denoted by 
SC{M,N), respectively, SC+{M, N). 

Moreover, given an integer m with N < m < M, $ = {>fk}k=i 
m-scalable, respectively, strictly m—scalable, if there exist a subset $/ = {ipk}kei 
with / C {1,2,... ,M}, = m, such that 4)/ = {(pk}kGi is scalable, respectively, 

strictly scalable. 

We denote the set of m-scalable frames, respectively, strictly m-scalable frames 
in J-{M, N) by SC{M, N, m), respectively, SC+{M, N, m). 

When the integer m is fixed in a given context, we will simply refer to an 
m—scalable frame as a scalable frame. The role of the parameter m is especially 
relevant when dealing with frames of very large redundancy, i.e., when M/N ^ 1. 
In such a case, choosing a “reasonable” m such that the frame is m—scalable could 
potentially lead to sparse representations for signals in . In addition, the prob¬ 
lems of finding the weights that make a frame scalable as well as determining the 
smallest m such that a given frame is m— scalable have been considered in pTlIldj . 
We shall give more details about this question in Section |2.2[ 

We now point out some special and trivial examples of scalable frames. When 
M = N, a frame $ is scalable if and only if $ is an orthogonal set. Moreover, when 
M > N, a ^ contains an orthogonal basis, then it is clearly A^—scalable. Thus, 
given M > N, the set SC{M, N, N) consists exactly of frames that contains an 
orthogonal basis for 

So from now on we shall assume without loss of generality that M > N + 1, that 
$ contains no orthogonal basis, and that (pk for £ ^ k. 
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Given a frame $ C assume that $ = U $2 where 
$i={^We$:y,W(iV)>0} 

and 

$2 = G < 0}. 

In other words, $1 consists of all frame vectors from $ whose coordinates are 
nonnegative. Then the frame $' = $1 U (—$ 2 ) = U {—has the same 

frame operator as $. In particular, $ is a tight frame if and only if $' is a tight 
frame. In addition, $ is scalable if and only if $' is scalable with exactly the same 
set of weights. Note that the frame vectors in $' are all in the upper-half space. 
Thus, when convenient we shall assume without loss of generality that all the frame 
vectors are in the upper-half space, that is $ C x 1R+ where IR+ = [0,oo). 

We note that a frame $ = C with (pk 0 for each fc = I,..., M is 

scalable if and only if <!)' = is scalable. Consequently, we might assume 

in the sequel that we work with frames consisting of unit norm vectors. 

We now collect a number of elementary properties of the set of scalable frames 
in We refer to [521 [53] for details. 

proposition 2.2. Let M > N, and m > I be integers. 

(i) If $ G is m-scalable then m> N. 

(ii) For any integers m, m' such that N < m < m' < M we have that 

SC{M, N, to) C SC{M, N, to'), 


SC(M,N) = SC{M,N,M) = jj SC{M,N,Tn). 

m—N 

(hi) <I> G SC{M, N) if and only if T(<I>) G SC{M, N) for one (and hence for all) 
orthogonal transformation(s) T on 

(iv) Let $ = {pk}k=i G .F(7V -|- I,A^) \ {0} with ^ for k ^ If 
$ G SC+iN + 1,N,N), then $ ^ SC+{N+1,N). 

remark 2.3. We point out that part (hi) of Proposition [272] is equivalent to saying 
that $ is not scalable if one can hnd an orthogonal transformation T on such 
that r($) is not scalable. 


Besides these elementary properties, a study of the topological properties of the 
set of scalable frames was considered in [521 US] ■ In particular. 


proposition 2.4. Let M > N > 2. 

(i) SC{M,N) is closed in iF{M,N). Furthermore, for each N < m < M, 
SC{M, N,m) is closed in iF{M,N). 

(ii) If M < N{N + I)/2, then the interior of SC{M,N) is empty. 


Part (i) of proposition 2.4 was proved in [S21 Corrollary 3.3] and [S31 Proposition 
4.1], while part (ii) first appeared in [S31 Theorem 4.2]. 
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2.2. Convex polytopes associated to scalable frames. We now proceed to 
write an explicit formulation of the scalability problem. From this formulation a 
convex geometric characterization of SC{M, N) will follow. To start, we recall that 
$ denote the synthesis operator associated to the frame $ = $ is (m-) 

scalable if and only if there are positive numbers {xk}kei with = m > N such 
that $ = satisfies 

(2.1) = AIm = 

where X is the diagonal matrix with the weights Xk on its main diagonal if fc S / 
and 0 for k ^ and /at is the N x N identity matrix. Moreover, 


l!-’^l!o = #{fc ■■ Xk > 0} = rn> N. 


By rescaling the diagonal matrix X, we can assume that A 
equivalent to solving 


1. Thus, (2.11 is 


( 2 . 2 ) = In 

for Y = Ax^. 

A .. . 

To gain some intuition let us consider the two dimensional case with M > 3. In 
particular, let us describe when d) = C is a scalable frame. Without 

loss of generality, we may assume that $ = {‘Pk}kLi C K x M+, ||(/3fc|| = 1, and 
(/?£ ^ ±(pfc ioT I ^k. Thus 


= 


/ cos Qk 
Vsin Qk 


e 


with 


0 = 


Let Y = (yk)¥-. C [0, 


9i < 62 < O 3 < ... < 9m < 


TT. 




(2.3) 


Lfcli yfc cos^ 9k 
Y^kLiUk sin dk cos 9k 


YlkLiVk sin 9k cos 9k\ 
Vk sin^ 9k j 


This is equivalent to 



[ Y.k=l Vk COS^ dfc = 1 

\ ^ Vk sin^ 9k = I 

[ Y.^=iykAm9kCOs9k = 0 , 

and using some row operations we arrive at 


[ Efci2/fcSin^^fc = 1 
{ yfc cos 26»fc = 0 

[ I]fci2/fcSin26»fc = 0. 


For $ to be scalable we must find a nonnegative vector Y = {yk)k^i in the kernel 
of the matrix whose k*^ column is fI . Notice that the first equation is iust 

Vsm26»fc/ ^ •' 


a normalization condition. 

We now describe the the subset of the kernel of this matrix that consists of 
non-trivial nonnegative vectors. Observe that the matrix can be reduced to 
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(2.4) 


1 cos 202 
0 sin 202 


cos 29 m 
sin 29 m 


Example 2.5. We first consider the case M = 3 . In this case, we have 0 = 0i < 


92 < 93 < TT, and the (2.4) becomes 


, . /l cos 202 cos 203 \ 

^ > 1^0 sin 202 sin 203 y ' 

If there exists an index fcp G {2,3} with sin20fc|j = 0, then 9kg = 7r/2 and the 
corresponding frame contains an ONB and, hence is scalable. 

(i) Moreover, if fcg = 2, then 0 = 0i < 02 = 7r/2 < 03 < tt. In this case, the 
fame is 2 — scalable but not 3— scalable, i.e., the frame is in iSC+(3, 2, 2) \ 
5C(3, 2, 3). This is illustrated by Figure 

(ii) If ko = 3, then 0 = 0i < 02 < 03 = tt/2. By symmetry (with respect to the 
y axis) we conclude again that the fame is 2— scalable but not 3— scalable. 



Figure 1. A scalable frame (contains an orthonormal basis) with 
3 vectors in The original frame vectors are in blue, the frame 
vectors obtained by scaling are in red, and for comparison the 
associated canonical tight frame vectors are in green. 


Assume now that 9 k 7r/2 for k = 2 , 3 . If 03 < tt / 2 , then the frame cannot be 
scalable. Indeed, u= {zi,Z 2 ,Z 3 ) belongs to the kernel of (2.51 if and only if 


( 2 . 6 ) 


Zl 

Z2 


sin2(6>3-6>2) 

sin 202 

sin 203 

sin 202 3, 


where 2:3 G K. The choice of the angles implies that Z 2 Z 3 < 0 and Z 1 Z 3 < 0 
with equality if and only if 23 = 0. This is illustrated by Figure Similarly, if 
0 = 01 < 7r/2 < 02 < 03 < TT, then the frame cannot be scalable. 

On the other hand if 0 = 0i < 02 < 7r/2 < 03 < tt, then it follows from (2.6) 
2i > 0 and 22 > 0 for all 23 > 0 if and only if 03 — 02 < tt / 2 . Consequently, 
when 0 = 01 < 02 < tt /2 < 03 < tt the frame 4) G 5C+(3,2,3) if and only if 
0 < 03 — 02 < tt / 2 . This is illustrated by Figure]^ 
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Figure 2. A non scalable frame with 3 vectors in The original 
frame vectors are in bine, for comparison the associated canonical 
tight frame vectors are in green. 



Figure 3. A scalable frame with 3 vectors in The original 
frame vectors are in blue, the frame vectors obtained by scaling 
are in red, and for comparison the associated canonical tight frame 
vectors are in green. 

Example 2.6. Assume now that M = 4. Then we are lead to seek nonnegative 
non-trivial vectors in the null space of 



If there exists an index fco G {2,3,4} with sin20fcg = 0, then 9^^ = 7 r /2 and 
the corresponding frame contains an ONB. Consequently, the frame is scalable. In 
particular, 

(1) When ko = 2, the null space of the matrix is described by 



where Z 2 ,Z 4 € M. Note that Z 3 < 0, with equality only when Z 4 = 0, in 
which case Z 3 = 0 and the frame will be 2 —scalable, but not m—scalable 
for m = 3,4. This is illustrated by the left figure in Figure]^ 
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Figure 4. A scalable frame (contains an orthonormal basis) with 
with 4 vectors in The original frame vectors are in blue, the 
frame vectors obtained by scaling are in red, and for comparison 
the associated canonical tight frame vectors are in green. 


(2) If instead, fco = 3, then a similar argument shows that 


2:1 

2:2 


2:3 H- 

_ sin 2^4 

sin 202 


2 (^ 4 —^ 2 ) 

sin 2^2 

Z4, 


2^4, 


where 23 , 2:4 S K. Any choice of 24 > 0 will result in 22 > 0. If we choose 
04 — 02 < 7 r/ 2 , then 23 > 0 will lead to a 3— scalable frame or a 4—scalable 
frame. If instead, 04 — 02 > 7 r /2 we can always choose 23 > 0 and large 
enough to guarantee that 21 > 0, hence $ will be 4—scalable. 

(3) When ko = 4, then <I> G 5C+(4, 2, 2) \ 5C(4, 2, m) for m = 3,4. 

(4) When sin 20^ ^ 0 for k G {2, 3,4} then 


Sin2(e3-e2) I Sin2(e4-e2)„ 

sin 202 ^3 -r sin26»2 ^4, 

sin 203 _ sin 204 ^ 

sin 202 sin 202 

where 23,24 G ffi. A choice of 23,24 > 0 will lead to a scalable frame if at 
least 2 i > 0 or 22 > 0. For example. Figure]^ shows a scalable frame. 




Figure 5. A scalable frame with with 4 vectors in K^. The origi¬ 
nal frame vectors are in blue, the frame vectors obtained by scaling 
are in red, and for comparison the associated canonical tight frame 
vectors are in green. 

But in this case we could also get non scalable frame, see Figure . 
The implication here is that the scalability of the frames depends on the 
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relative position of the frame vectors (hence) the angles dk- This will be 
made rigorous in Section [2731 



Figure 6 . A non scalable frame with 4 vectors in The original 
frame vectors are in blue, for comparison the associated canonical 
tight frame vectors are in green. 


More generally in this two dimensional case, we can continue this analysis of 
the transformation given by the matrix (2.4) to characterize when $ = 


is scalable. From the figures shown in Examples |2.5| and |2.6[ it is clear that some 
geometric considerations are involved. Before elaborating more on these geometric 
considerations in Section [57^ we now consider the general case M > N > 2. In 
particular, we follow the algebraic approach given in the two-dimensional case, by 
writing out the equations in (2.2) and collecting all the diagonal terms one the 
one hand, and the non-diagonal terms on the other, we see that for a frame to be 
m-scalable it is necessary and sufficient that there exists u = (cf, c|,..., with 
Iloilo := : Uk > 0} < m which is a solution of the following linear system of 

equations in the M unknowns 


(2.7) 


M 

M 

i=i 


1 for k = 1,... ,N, 

0 foT k > £ = I,..., N. 


We can further reduce this linear system in the following manner. We keep all 
the equations with homogeneous right-hand sides, i.e., those coming from the non 
diagonal terms of (2.2). There are N{N — l)/2 such equations. The remaining N 
equations come from the diagonal terms of (2.2), and their right hand-sides are all 
1. We can use row operations to reduce these to a new set of N linear equations 
the first of which will be 

M 


= 1 - 
i=i 


For k = 2,... ,N, the equation is obtained by subtracting row 1 from row k 
leading to 

M 

- ‘Pj¥?)yj = 0 - 
i=i 
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The condition 

M 

= 1 

is a normalization condition, indicating that if <i> can be scaled with y = {yj)fLi C 
[0, oo), then it can also be scaled by Xy for any A > 0. Thus, ignoring this condition 
and collecting all the remaining equations, we see that <i> is m-scalable if and only 
if there exists a nonnegative vector u G with ||m||o < rn such that 

F($)u = 0, 

where the {N — 1){N + 2)/2 x M matrix F{^) is given by 

F{<^) = {F{ipi) Fiip2) ... F{ipM)) , 
where F : —)■ d := {N — 1){N + 2)/2, is defined by 



( \ 


f xl-xl\ 



(2.8) F{x) = 

Fi{x) 

, Fq{x) = 


, Fk{x) = 

^/c^/c+2 


\.Fn-i{x)/ 


yxf - 


\ XkXN / 


and Fo{x) G Fk{x) G fc = 1, 2,..., - 1. 

To summarize, we arrive at the following result that was proved in [531 Proposi¬ 
tion 3.7] 

proposition 2.7. [SSI Proposition 3.7] A frame $ for is m-scalable, respectively, 
strictly m-scalable, if and only if there exists a nonnegative u G ker A($) \ {0} with 
||u||o ^ iTT-j respectively, UmUo = w, where HmJIo = : Rfe > 0}. 

In the two dimensional case the map F reduces to 



However, in all the previous examples we considered instead the more geometric 
map : 



It is readily seen that A($) and F{^) have exactly the same kernel. In fact the 
map F carries the following geometric interpretation. Let Lg be a line through the 
origin in which makes an angle 9 with the positive a;—axis with 9 G [0, tt]. Then 
the image of Lg by F is the line L2g that makes an angle 29 with the positive x— 
axis. That is, F just rotates counterclockwise the line Lg by an angle equal to 9. 

In the two dimensional case we exploited the geometric meaning of the map F 
or F to describe the subset of nonnegative vectors of the nullspace of F{^). More 
generally, to find nonnegative vectors in the nullspace of the matrix T'($) we can 
appeal to one of the formulation of Parkas lemma: 

Lemma 2.8. [601 Lemma 1.2.5] For every real N x M-matrix A exactly one of the 
following cases occurs: 

(i) The system of linear equations Ax = 0 has a nontrivial nonnegative solution 
X G i.e., all components of x are nonnegative and at least one of them 
is strictly positive. 
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(ii) There exists y G such that A is a vector with all entries strictly 
positive. 

Applying this in the two dimensional case, we see that for the frame to be 
scalable, the second alternative in Farkas’s lemma should not hold. That is there 
must exist no vector in that lies on “one side” of all the vectors F{ipk) for 
fc = 1, 2,..., M. We illustrate this by the following hgures: 



y' 

/ 

! 

I 

\ 

\ 

\ 



\ 

f 

I 


/ 


/ 



Figure 7. Frames with 4 vectors in (in blue, top and bottom 
left) and their images by the map F (in green, top and bottom 
right). Both of these examples result in non scalable frames. 

We make the following observation about the first alternative of Lemma |2.8[ If 
C represents the column vectors of A, then there exists of a vector 
0 ^ X = {xk)^^i with Xk> 0 such that Ax = 0 is equivalent to saying that 

M 

^ ^ XkAk — 0 . 
fc=i 

Without loss of generality we may assume that = 1, in which case the 

condition is equivalent to 0 being a convex combination of the column vectors of A. 
Thus, having a nontrivial nonnegative vector in the null space of A is a statement 
about the convex hull of the columns of A. 

Motivated by the geometric intuition we gained from the two-dimensional setting 
and to effectively use Farkas’s lemma, we introduce a few notions from convex 
geometry, especially the theory of convex polytopes, and we refer to [551IHO] for 
more details on these concepts. For a finite set X = C K'^, the polytope 

generated by X is the convex hull of X, which is a compact convex subset of . 
In particular, we denote this set by Px (or co(X)), and we have 

{ M M 

ctkXk : Ofc > 0, = I 

fc=l k=l 
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Figure 8 . Frames with 4 vectors in (in blue, top and bottom 
left) and their images by the map F (in green, top and bottom 
right). Both of these examples result in scalable frames. 


The affine hull generated by X is defined by 

{ M M 

^akXk : y^Qffc = 1 

k=l k=l 

We have co(X) C aS{X). The relative interior of the polytope co(X) denoted by 
ri co{X), is the interior of co(X) in the topology induced by aS{X). We have that 
ri co{X) 7 ^ 0 as long as > 2, and 


{ M M 

ctkXk : gfc > = 1 

k=l fc=l 

The polyhedral cone generated by X is the closed convex cone C{X) defined by 

{ M 

CtkXk ■■ ak>0 

i=k 

The polar cone of C{X) is the closed convex cone C°{X) defined by 

C°{X) := : {x,y) < 0 ior all y G C{X)}. 

The cone C{X) is said to be pointed if C{X) n {—C{X)) = {0}, and blunt if the 
linear space generated by C{X) is , i.e., spanC'(X) = R^. 

Using Proposition |2.7[ we see that $ is (m—)scalable if there exists {Xk}kei C 
[0, oo) , #/ = m such that 

y] AfcF(v?fc) = 0. 

kGl 

This is equivalent to saying that 0 belongs to the polyhedral cone generated by 
F(4>/) = {F(i^fc)}fcg/. Without loss of generality we can assume that Afc = 1 
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which implies that 0 belongs to the polytope generated by F{^j) = {F{(fk)}kGi- 
Putting these observations together with Lemma |2.8| the following results were 
established in theorem 3.9]. In the sequel, we shall denote by [K] the set of 
integers {1,2,..., K} where K gN. 

theorem 2.9. [53l Theorem 3.9] Let M > N > 2, and let m be such that N < 
m < M. Assume that <!> = S F*{M, N) is such that (pk ^ Pi when k ^ i. 

Then the following statements are equivalent: 

(i) <!> is m—scalable, respectively, strictly m—scalable, 

(ii) There exists a subset I C [M] with ffl = m such that 0 € co{F{^j)), 
respectively, 0 G ri co(F(^/)). 

(iii) There exists a subset I C [M] with ffl = m for which there is no h G 
with {F{(pk),h) > 0 for all k G I, respectively, with {F{(pk),h) > 0 for all 
k G I, with at least one of the inequalities being strict. 


The details of the proof of this result can be found in [53l Theorem 3.9]. We point 
out however, that the equivalence of (i) and (ii) follows from considering co(F($/)) 
which is the polytope in generated by the vectors {F{(pk)}kGi- 

By removing the normalization condition that the norm of the weights making 
a frame scalable is unity, Theorem 2.9 can be stated in terms of the polyhedral cone 
C{F{^)) generated by F{^). This is the content of the following result which was 
proved in [53l Corollary 3.14]: 


Corollary 2.10. [S31 Corollary 3.14] Let $ = {pk}^=\ € £F*, and let N < m < M 
be fixed. Then the following conditions are equivalent: 

(i) 4) is strictly m-scalable . 

(ii) There exists I C [M] with ffl = m such that C(F{^j)) is not pointed. 

(iii) There exists I C [M] with ffl = m such that C(F(4>/))° is not blunt. 

(iv) There exists I C [M] with ffl = m such that the interior of C'(F(4>/))° is 

empty. 


The map F given in (2.8) is related to the diagram vector of [41], and was 
used in [25j to give a different and equivalent characterization of scalable frames 
which we now present. We start by presenting an interesting necessary condition 
for scalability in both and proved in [^ Theorem 3.1]: 


theorem 2.11. [251 Theorem 3.1] Let^ — {pk}kLi ^ d^u{M,N). //<!> G SC{M,N), 
then there is no unit vector u G such that |(?i, pk)\ > for all k = 1, 2,..., M 
and \ {u,pk)\ > for at least one k. 


As pointed out in [25| the condition in Theorem 2.11 is also sufficient only when 
N = 2. We wish to compare this result to the following theorem that give a 
necessary and a (different) sufficient condition for scalability in R^, and these two 
conditions are necessary and sufficient only for N = 2. 


theorem 2.12. [22| Theorem 4.1] Let $ G Fu(M, N). Then the following hold: 
(a) {A necessary condition for scalability) If ^ is scalable, then 


min max | (d, pk) | > 
||d ||2 = l k 


1 

Vn' 


(2.9) 





PRECONDITIONING TECHNIQUES IN ERAME THEORY AND PROBABILISTIC FRAMES17 


(b) 

( 2 . 10 ) 


{A sufficient condition for scalability) If 


min max |(d, > 

I<i|l2 = l fc 


1 

N 


then $ is scalable. 


Clearly when N = 2 the right hand sides of both (2.9) and (2.10) 
to a necessary and sufficient condition. 

Observe that (2.9) is equivalent to the fact that for each unit 
there exists k = 1, 2 ,..., M such that 


coincide leading 
vector d S , 


which is different from the condition in theorem 12.111 

We can now present the characterization of scalable obtained in Theorem 
3.2] and which is based on the Gramian of the diagram vectors. More precisely, for 
each V € we define the diagram vector to be the vector v G given by 


( 2 . 11 ) 


i;(l)^ — v(2)^ 



v(N - 1)^ - v(N)^ 
\/mv(l)v(2) 


VIlVv(N - l)v(N\ 


where the difference of the squares v^(i) — v^(j) and the product v{i)v{j) occur 
exactly once for z < j, * = 1, 2,..., iV — 1. Using this notion, the following result 
was proved: 


theorem 2.13. [5SJ Theorem 3.2] Let $ = G be a frame of unit- 

norm vectors, and G = f>e)) be the Gramian of the diagram vectors 

Suppose that G is not invertible. Let {vi,V 2 , ■ • ■ ,u^} be a basis of the nullspace of 
G and set 

vi{i) 

ve{i), 

for z = 1, 2,..., M. Then $ is scalable if and only if 0 ^ co{ri, r 2 ,..., tm}- 


When a frame <i> = C R^ is scalable, then there exist C [0,oo) 

such that {ck‘Pk}^=i is a tight frame. The nonnegative vector uj = is 

called a scaling of $ [14]. The scaling uj = C [0, oo) is said to be a 

minimal scaling if {ipk • > 0} has no proper subset which is scalable. The notion 

of minimal scalings has recently found some very interesting applications on some 
structural decomposition of frames; see, im Section 4] for more details. It turns out 
that finding the scalings of a scalable frame can be reduced to finding its minimal 
scalings. More specifically, the following result was proved in m theorem 3.5]: 

theorem 2.14. [TU Theorem 3.5] Suppose $ = {'Pk}^=i G iFu is a scalable frame, 
and letui = {uJk}^i C [0,oo) be one of its minimal scalings. Then {(pkTk • > 0} 
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is linearly independent. Furthermore, every scaling o/ $ is a convex combination 
of minimal scalings. 


2.3. A geometric condition for scalability. The two dimensional case we ex¬ 
amined earlier (Example |2.5| and Example |2.6[ ) indicates that a frame is not scalable 
when the frame vectors “cluster” in certain “small” plane regions. In fact, broadly 
speaking, the frame is not scalable if its vectors lies in a double cone C n (—C) with 
a “small” aperture. This was formalized in theorem |2.9| and Corollary |2.10| We 
can further exploit these results to give a more formal geometric characterization 

of scalable frames. _ 

To begin, we rewrite (iii) of Theorem 2.9 in the following form. For x = {xk)k=i G 


and h = {hk)k=i G we have that 


( 2 . 12 ) 


N 


(Fix), h) = ^ hi_i{xi -xj)+ ^ 
1^2 


N -1 N 


hh(N-l-{k-l)/2)+i-lXkX(.. 

k=l i=k+l 


Consequently, fixing h {F{x), h) is a homogeneous polynomial of degree 2 in 

Xi,X 2 ,. ■. ,xm■ Denote the set of all polynomials of this form by P 2 ■ Then P 2 
can be identified with the subspace of real symmetric N x N matrices whose trace 


is 0. Indeed, for each N >2, and each p G P^ , 


N 


N -1 




N 


- xl) -b E z. ak(N-{k+l)/2)+l-lXkX(., 

k=l i=k+l 

we have p{x) = {QpX,x), where Qp is the symmetric N x A-matrix with entries 


e=2 


N -1 

Qp{^^ 1) “ 'y ^ ®fc> 

fc=i 


Qp{e,i) = -ae-i for ^ = 2, 3,..., A 


and 


Qpik, F) = -ak(^N-(k+i)/2)+i-i for /c = 1,..., A - 1, E = k + l,...,N. 


Thus, {F{x),h) = {QhX, x) = 0 defines a quadratic surface in , and condition 


(iii) in Theorem 2.9 stipulates that for $ to be scalable, one cannot find such a 


quadratic surface such that the frame vectors (with index in I) all lie on (only) 
“one side” of this surface. By taking the contrapositive statement we arrived at 
the following result that was proved differently in [521 Theorem 3.6]. In particular, 
it provides a characterization of non-scalability of finite frames, and we shall use it 
to give a very interesting geometric condition on the frame vectors for non-scalable 
frames. 


theorem 2.15. [521 Theorem 3.6] Let d> = G F* ■ Then the following 

statements are equivalent. 

(i) d) is not scalable. 

(ii) There exists a symmetric matrix Y G with Tr{Y) < 0 such that 

(Yifkjifk) > 0 for all k = 1 ,..., M . 

(iii) There exists a symmetric matrix Y G R^^^ with Tr{Y) = 0 such that 
(Yipk, (pk) > 0 for all k = 1,..., M. 


To derive the geometric condition for non-scalability we need some set up. It is 


not difficult to see that each symmetric A x A matrix Y in (iii) of theorem 2.15 
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corresponds to a quadratic surface. We call this surface a conical zero-trace quadric. 
The exact definition of such quadratic surface is 


Definition 2.16. [521 Definition 3.4] Let the class of conical zero-trace quadrics 
Cn be defined as the family of sets 


(2.13) 


X e 




N-1 'j 

: ^ ak{x,ekf = \ 

k=\ ) 


where {ek}k=i runs through all orthonormal bases of and {ak)k=i runs through 


all tuples of elements in K \ { 0 } with = 1 - 


We define the interior of the conical zero-trace quadric in (2.131, by 
< X € ^ afc(x,efc)^ < (x,e7v)^ > , 


/c=l 


and the exterior oi the conical zero-trace quadric in (2.13) by 


N-l 

X € ^ afc(x,efc)^ > (x,e7v)^ 

k^l 

It is then easy to see that Theorem |2.15| is equivalent to the following result 
established in [52 theorem 3.6] 

theorem 2.17. [521 Theorem 3.6] Let 4) C \ {0} be a frame for . Then the 
following conditions are equivalent. 

(i) 4> is not scalable. 

(ii) All frame vectors of $ are contained in the interior of a conical zero-trace 
quadric ofCN. 

(hi) All frame vectors of $ are contained in the exterior of a conical zero-trace 
quadric of Cm. 

The geometric meaning of this result is best illustrated by considering frames in 
and in which case the sets Cm for N = 2,3, have very simple descriptions 
given in [55] • For our purposes here it suffices to say that each set in C 2 is the 
boundary surface of a quadrant cone in i.e., the union of two orthogonal one¬ 
dimensional subspaces (lines through the origin) in The sets in C 3 are the 
boundary surfaces of a particular class of elliptical cones in We give examples 
of sets in Cat iV = 2, 3 in Figure (a) and (b). 

We can now state the following corollary that give a clear geometric insight into 
the set of non-scalable frames. In particular, the frame vectors cannot lie in a 
“small cone”. 

Corollary 2.18. [52 Corollary 3.8] 

(i) A frame $ C \ {0} for is not scalable if and only if there exists an 
open quadrant cone which contains all frame vectors of $. 

(ii) A frame 4) C \ {0} for is not scalable if and only if all frame vectors 
of 4> are contained in the interior of an elliptical conical surface with vertex 
0 and intersecting the corners of a rotated unit cube. 
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(a) (b) 

Figure 9. (a) shows a sample region of vectors of a non-scalable 
frame in (b) shows example of and which determine 
sample regions in 


2.4. Scalable frames and Fritz John theorem. The last characterization of 
scalable frames we should discuss is based on Fritz John’s ellipsoid theorem. Before 
we state this theorem, we recall from Section |2.2| that given a set of points V = 
{yfc}fe=i C Py is the polytope generated by Y. 

Given a.n N x N positive definite matrix X and a point c G , we define an 
TV-dimensional ellipsoid centered at c as 

E{X, c)=c + X-^/^{B) = {v : {X{v - c), (v - c)} < 1}, 

where B is the unit ball in We recall that the volume of the ellipsoid is given 
by Volume(£’(Jf, c)) = (det(Jf))“^/^wjv, where cujv is the volume of the unit ball in 

A convex body K C is a nonempty compact convex subset of IR^. It is 
well-known that for any convex body K with nonempty interior in there is a 
unique ellipsoid of minimal volume containing K; e.g., see [7^ Chapter 3]. We 
refer to nisiiiniiisiEi] for more on these extremal ellipsoids. Fritz John ellipsoid 
theorem gives a description of this ellipsoid. More specifically: 

theorem 2.19. [151 Section 4] Let K C B — B^{0, 1) (unit ball in IR-^^ be a convex 
body with nonempty interior. Then B is the ellipsoid of minimal volume containing 
K if and only if there exist ^ cc) C , m > A^-|-l 

such that 

(i) Tlk=i ^kUk = 0 

(ii) x = jy((=i^k{x,Uk)uk,yxeR^ 

where dK is the boundary of K and is the unit sphere in IR'^. In particular, 

the points Uk are contact points of K and S^~^. 


Observe that (ii) of Theorem 2.19 can be written as 

m 

X = y^fx, \/AfcMfc)\/AfcUfc iorall x 


t>N 


k=l 


which is equivalently to saying that the vectors {uk}^^i form a scalable frame. 
The difficulty in applying this theorem lies in the fact that determining the contact 
points Uk and the “multipliers” Xk is an extremely difficult problem. Nonetheless, 
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we can apply this result to our question since we consider the convex body generated 
by the frame vectors, in which case these contact points are subset of the frame 
vectors. In particular, to apply the Fritz John theorem to the scalability problem, 
we consider a frame d) G of consisting of unit norm vectors. We 

define the associated symmetrized frame as 

and we denote the ellipsoid of minimal volume circumscribing the convex hull of 
the symmetrized frame d>syi„ by and refer to it as the minimal ellipsoid of $. 
Its ‘normalized’ volume is dehned by 


K, := MM. 

UJn 

Clearly, < 1, and it is shown in [5^ theorem 2.11] that equality holds if and 
only if the frame is scalable. That is, we have 

theorem 2.20. [32 Theorem 2.11] A frame d* G iFu{M,N) is sealable if and only 
if its minimal ellipsoid is the N-dimensional unit ball, in which case = 1. 

remark 2.21. Given a unit-norm frame d>, the number defined above is one of 
a few measures of scalability introduced in [22] . These are numbers that measure 
how close to being scalable a frame $ is. For example, if for a given d>, < 1, 

then the farther away from 1 it is, the less scalable is d>. Thus V$ along with these 
other measure of scalability can be used to define “almost” scalable frames. We 
refer to [22] for details. 

Using the geometric characterization of scalable frames by one can define the 
following equivalence relation on N): d*, d> S N) are equivalent if and 

only if Uj = Uj,. We denote each equivalence class by the unique volume for all 
its members. Specifically, for any 0 < a < 1, the class P[M,N,a] consists of all 
d> G Tu{AI,N) with = a. Then, SC{M,N) = P[M,N,1\. This also allows a 
parametrization of Pu{M, N): 

PuiM,N) = u. P[M, N, o]. 

aG(0,l] 


3. Probabilistic frames 


Definition o introduces frames from a linear algebra perspective through their 
spanning properties. However, frames can also be viewed as point masses dis¬ 
tributed in In this section we survey a measure theoretical, or more precisely 
a probabilistic, description of frames. In particular, in Section |3.1| we define prob¬ 
abilistic frames and collect some of their elementary properties. In Section |T2] We 
define the probabilistic f rame potential and investigate its minimizers. We general¬ 
ize this notion in Section 3.3 to the concept of frame potentials and discuss their 
minimizers. Probabilistic analogs of these potentials are considered in Section |3^ 
This section can be considered as a companion to [32] where many of the results 
we stated below first appeared. 
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3.1. Definition and elementary properties. Before defining probabilistic frames 
we first collect some definitions needed in the sequel. 

Let V := denote the collection of probability measures on with 

respect to the Borel cr-algebra B. Let 

1^2 := 7^2(K^) = \lJL&V ■. M|(/r) := [ ||a;|pd/i(x) < oo 

I Jr" 

be the set of all probability measures with finite second moments. Given /i, € 

V 2 , let r(/r, be the set of all Borel probability measures 7 on x whose 
marginals are /i and i', respectively, i.e., ^{A x = /i(A) and 7 (IR'^ x B) = v{B) 
for all Borel subset A, B in . The space V 2 is equipped with the 2- Wasserstein 
metric given by 


(3.1) 


W|(/i, v) := min 


JR"xR" 


\\x - yfd'y{x,y),j G 


It is known that the minimum defined by (3.1 1 is attained at a measure 70 € r(/i, ix), 
that is: 




(fj.,iy) = ||x - yfd'jo{x,y). 

Jr"xR" 


We refer to O Chapter 7], and [TH Chapter 6 ] for more details on the Wasserstein 
spaces. 


Definition 3.1. A Borel probability measure fj, G V is a. probabilistic frame if there 
exist 0 < A < i? < 00 such that for all x £ we have 


(3.2) A\\xr<[ \{x,y)\^df,{y)<B\\xr. 

Jr" 

The constants A and B are called lower and upper probabilistic frame bounds, 
respectively. When A = B, p, is called a tight probabilistic frame. 


It follows from Definition 3.1 that the upper inequality in (3.2) holds if and 


only A p G 7 ^ 2 - With a little more work one shows that the lower inequality holds 
whenever the linear span of the support of the probability measure p is 


Assume that pis a tight probabilistic frame, in which case equality holds in (3.2). 
Hence, choosing x = e^ where is the standard orthonormal basis for M." 

leads to 


A\\ekf=A=f {ek,yfdp{y). 


Therefore, 


N 


N 


na = Y,a\\ 


efell = 


'^{ek,yfdp{y) = / \\y\\^dpiy) = M 2 {p) 








Consequently, for a tight probabilistic frame p, A = . 

These observations can are summarized in the following result whose proof can 
be found in [32] 


theorem 3.2. |32[ Theorem 12.1] A Borel probability measure p G V is a prob¬ 
abilistic frame if and only if p G V 2 and , where denotes the linear 







PRECONDITIONING TECHNIQUES IN ERAME THEORY AND PROBABILISTIC FRAMES23 


span o/supp(/i) inM.^. Moreover, if is a tight probabilistic frame, then the frame 
bound is given by 

We now consider some examples of probabilistic frames. 

Example 3.3. (a) A set $ = C is a frame if and only if the prob¬ 

ability measure pii^ = ^ ^vk supported by the set $ is a probabilistic 
frame, where 6^^ denotes the Dirac measure supported at ip G . 

(b) More generally, let a = {ak}kLi C (0, oo) with Ofe = 1- A set $ = 

{ipk}^=i C is a frame if and only if the probability measure = 
'n!k=i o-kbipk supported by the set <1) is a probabilistic frame. 

(c) By symmetry consideration one also shows that the uniform distribution on 

the unit sphere in R^ is a tight probabilistic frame [301 Proposition 

3.13]. That is, denoting the probability measure on by da we have 

that for all x G R^ , 

^=/ {x,yfda{y). 

In the framework of the Wasserstein metric, many properties of probabilistic can 
be proved. For example, if we denote by P{A, B) the set of probabilistic frames 
with frame bounds 0 < A < S < oo, then the following result was proved in |H31 
Proposition 1]: 

proposition 3.4. [831 Proposition 1] P{A,B) is a nonempty, convex, closed subset 

of iP 2 (K^). 


Other results including a probabilistic treatment of the frame scalability prob¬ 
lem also appeared in [53]. Furthermore, in [53] an optimal transport approach to 
minimizing a frame potential that generalizes the Benedetto and Fickus potential 
was developed. In the process, the smoothness (in the Wasserstein metric) of this 
potential was derived. 

Probabilistic frames can be analyzed in terms of a corresponding analysis op¬ 
erator and its adjoint the synthesis operator. Indeed, let /i S 7^ be a probability 
measure. The probabilistic analysis operator is given by 

T^:R^ ^ L^{R^,Pl), x^{x,-). 

Its adjoint operator is defined by 

t; ■.l^{r^,p)^r^, f^ [ f{x)xdp{x) 

and is called the probabilistic synthesis operator, where the above integral is vector¬ 
valued. The probabilistic frame operator of p is 

Q rri* rp 


and one easily verifies that 

Sf,:R^^R^, Sf_,{x) = [ {x,y)ydfi{y). 

jR" 
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If is the canonical orthonormal basis for then 


N 

Sfj.ei = y^mij{fi)ej, 

i=i 


where 

mij = f 

JR« 

is the {i,j) entry of the matrix of second moments of fi. Thus, the probabilistic 
frame operator is the matrix of second moments of y. Consequently, the following 
results proved in [32] follows. 


proposition 3.5. [321 Proposition 12.4] Let y G V, then 5'^ is well-defined (and 
hence bounded) if and only if 

M2{y) < oo. 

Furthermore, p is a probabilistic frame if and only if is positive definite. 


If /i is a probabilistic frame then is invertible. Let p be the push-forward of 
p through S~^ given by 

p = poS^. 

In particular, given any Borel set B C we have 

p{B) = p{{S-^)-^B)=p{S^B). 

Equivalently, p can be defined via integration. Indeed, if / is a continuous bounded 
function on , 

f f{y)dp{y) = [ f{S~^y)dp{y). 

Jr" Jr" 

In fact, p is also a probabilistic frame (with bounds lfB< 1/A) called the proba¬ 
bilistic canonical dual frame of p. Similarly, when p is a probabilistic frame, 5^ is 

_ /2 

positive definite, and its square root exists. The push-forward of p through 
is given by 

p\B) = p{S^I’^B) 

for each Borel set in The properties of these probability measures are summa¬ 
rized in the following result. We refer to [321 Proposition 12.4] and [321 Proposition 
12.5] for details. 


proposition 3.6. Let p G V he a probabilistic frame with bounds 0 < A < B < oo. 
Then: 

(a) p is a probabilistic frame with frame bounds 1/B < 1/A. 

(b) p^ is a tight probabilistic frame. 

Consequently, for each x G we have: 

(3.3) f {x,y) Sfj,ydp{y) = f {S~^x,y) y dp{y) = S^S~^{x) = x, 

Jr" Jr" 


[ {x,y)ydp^{y) = [ {S^^/^x,y) S^^/^ydp{y) 
Jr" Jr" 




and 

(3.4) 
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It is worth noticing that (3.3) is the analog of the frame reconstruction for¬ 
mula O) while ( |3.4[ ) is analog of ( |1.2[ ). 

In the context of probabilistic frames, the probabilistic Gram operator, or the 
probabilistic Gramian of p, is the compact integral operator defined on 

by 

G,.f{x) =T^T*f{x) = [ K{x,y)f{y)dp{y) = [ {x,y) f{y)dp{y). 

JR" JR" 


It is immediately seen that is an integral operator with kernel given by K{x, y) = 
{x,y), which is continuous and in xK^,/i(g)/i) C xM.^, p), where 

/i 0 /i is the product measure of p with itself. Consequently, is a trace class and 
Hilbert-Schmidt operator. Moreover, for any / G L^(R^,p), G^f is a uniformly 
continuous function on . As well-known, and 5^ have a common spectrum 
except for the 0. In fact, in the next proposition we collect the properties of 


proposition 3.7. [32l Proposition 12.4] Let p G V then is a trace class and 
Hilbert-Schmidt operator on The eigenspace corresponding to the eigen¬ 

value 0 has infinite dimension and consists of all functions 0 ^ / G L^(K^,/i) such 


that 



yf{y)dp{y) = o. 


While new finite frames can be generated from old ones via (linear) algebraic 
operations, the setting of probabilistic frames allows one to use analytical tools to 
construct new probability frames from old ones. For example, it was shown in |32j 
when the convolution of a probabilistic frame and a probability measure yields a 
probabilistic frames. The following is a summary of some of the results proved in 

m- 


proposition 3.8. [32J Theorem 2 & Proposition 2] The following statements hold: 

(a) Let p G V 2 he a, probabilistic frame and let v G V 2 - If supp(/i) contains at 
least -I- 1 distinct vectors, then p* 1 / is a probabilistic frame. 

(b) Let p and u be tight probabilistic frames. If ydv{y) = 0, then p * v is 
also a tight probabilistic frame. 


3.2. Probabilistic frame potential. One of the motivations of probabilistic frames 
lies in Benedetto and Fickus’s characterization of the FUNTFs as the minimizers 
of the frame potential (1.4). In describing their results, they motivated it from a 
physical point of view drawing a parallel to Coulomb’s law. It was then clear that 
the notion of frame potential carries significant information about frames, and can 
be viewed as describing the interaction of the frame vectors under some “physical 
force.” This in turn partially motivated the introduction of a probabilistic analog 
to the frame potential in |29| . Furthermore, the probabilistic frame potential that 
we introduce below, can be viewed in the framework of other potential functions, 
e.g., those investigated by Bjorck in [10]. In this section we review the properties 
of the probabilistic frame potential investigating in particular its minimizers. The 
framework of the Wasserstein metric space (P 2 , IF 2 ) also offers the ideal setting to 
investigate this potential and certain of its generalizations. While we should not 
report of this analysis here, we shall nevertheless introduce certain generalizations 
of the probabilistic frame potential whose minimizers are better understood in the 
context of the Wasserstein metric spaces. 

But we first start with the definition of the probabilistic frame potential. 
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Definition 3.9. The probabilistic frame potential is the nonnegative function de¬ 
fined on V and given by 


(3.5) 


PFP(m) = // \{x,y)\'^ dp,{x)dp.{y), 


for each pi GV. 


The following proposition is an immediate consequence of the above definition: 

proposition 3.10. Let p, G V, then PFP(/x) is the Hilbert-Schmidt norm of the 
probabilistic Gramian operator that is 


\\GXhs=II {x,yydp{x)dpiy). 

Furthermore, if /i S 7^2, (which is the case when ^ is a probabilistic frame) then 
we have 

PFP{p) <M^{p)<oo. 

We recall from Definition |3.1| that p is a tight probabilistic frame if 
/ {x,yfdp{y) = ^^^\\x\\^ 

for all X G Integrating this equation with respect to x leads to 

[[ {x,yydp{x)dp{y) =PFP{p) = 

J JR"xR" 

ft turns out that this value is the absolute lower bound to the probabilistic frame 
potential. 

theorem 3.11. |321 Theorem 3] Let p G 1^2 be such that M 2 {p) = 1 and set 
Efi ~ span{swpp{p)), then the following estimate holds 

(3.6) PFP(^) > 1/n 

where n is the number of nonzero eigenvalues of . Moreover, equality holds if 
and only if p is a tight probabilistic frame for E^. 

In particular, given any probabilistic frame p GV 2 with M 2 {p) = 1, we have 

PFP{p) > 1/N 

and equality holds if and only if p is a tight probabilistic frame. 

The proof of this result can be found in |32L Theorem 3]. Recently, a very simple 
and elementary proof of the last part of the result was given in [?, Theorem 5]. 
Furthermore, in [82j an optimal transport approach to minimizing a modification 
of the probabilistic frame potential was considered and showed great promise to 
analyze other potential functions in frame theory. Moreover, this approach has a 
natural numerical part that could be used as a gradient descent-type method to 
numerically find the minimizers of the PFP and its generalization. 
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3.3. The frame potentials. The techniques used to prove Theorem 3.11 can 
be used to investigate the minimizers of other related potential functions, especially 
when there are defined for probability measures supported on compact sets, such as 
on the unit sphere. In this section, we define a family of (deterministic) potentials 
and describe their minimizers. The probabilistic analogs of these results will follow 
in the next section. 

To motivate our definition, we recall the following result due to Strohmer and 
Heath m, and we refer to [ST] for historical perspectives on this result. 

theorem 3.12. [Til Theorem 2.3] For any frame $ = C with ||(/5fe|| = 

1, we have 


(3.7) 


max|(v5fc,7>^)| > 


M-N 


k^i .- V 

and equality hold if and only if ^ is a FUNTF such that 

M-N 


\{V’k,w)\ = f] 


(3-8) II — y Ar(M-l) 

when k ^ i. Furthermore, equality can hold only when M < 


A FUNTF that satisfies (3.81 is termed an equiangular tight frame (ETF). Note 


that the left-hand side of (3.7) can be viewed as a potential function of $. Indeed, 


this is the so-called coherence of $ that we shall defined for reasons to be evident 
later as 


(3.9) 


FPoo.m(^') = max|((/jfc,'^f)| 


for $ = Wk}k^i C In fact, FPoo,Ar(^) as well as the frame potential FP 


given in (1.4) are members of the family of the p*" frame potentials defined by: 


Definition 3.13. Let M be a positive integer, and 0 < p < c». Given a collection 
of unit vectors $ = {‘Pk}^=i C S^~^, the p-frame potential is the functional 


FP 


(3.10) 

When, p = oo, the definition reduces to 


M 


FPoo.m(^’) = max|(pfe,p^)|. 
k^^ 

It is clear that FPp_M and its minimizers are also functions of N, the dimension 
of the underlying space. However, to keep the notations simple, we shall not make 
explicit the dependence on N, unless it is necessary. 

The case p = 2 corresponds to the frame potential FP given in (1.4). As men¬ 
tioned above, FPoo ^(‘b) is the coherence of $ and plays a key role in compressed 
sensing [IIITIIMIIMIITI]. Moreover, for fixed M, the minimizers of FPqo.m are 
called Grassmanian frames [ilZI]. By using continuity and compactness arguments 
one can show that given M, N, FPoo_m always has a minimum [S] Appendix]. The 
challenge is the construction of these Grassmanian. In [8] constructions of Grassma- 
nians were considered for N = 2 and M >2, and for A^ = 3 when M G {3,4, 5, 6}. 
The ideas used in these constructions are based on analytical interpretation of 
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some geometric results obtained in m- The general question of constructing the 
minimizers of FPqo ^ for iV > 3 and M > 6 is still a mostly open question. 

Even more, minimizing is an extremely difficult problem as one needs to 

deal with both p, M, and the ambient dimension N. Some results on the minimizers 
as well as the value of the minimum as a function of the involved parameters were 
proved in [29]. We refer to [62| for earlier results on minimizing the frame 
potential. Before summarizing some of these results we consider the special case 
where M = 3, N = 2 and seek the minimizers of 

3 

FPp.3($)= E 

when p G ( 0 , oo) with the usual modification when p = oo, and $ — {(pk}k^i C S^. 
When p = 2^ 

3 

FP2.3($)= ^ \{^k,V^e)f>9/2 

k,e=i 

with equality if and only if $ = {<Pfc}fc=i C 5^ is a FUNTF. A minimizer of FP 2,3 
is the MB-frame which is pictured below: 


Ivl-B Frame 



Figure 10. An example of Equiangular FUNTF: the MB-frame. 

When p = oo, 

FPoo, 3($) = max|((pfc,(p£)| > 1 /a/2 
k^£, 

with equality if and only if d) = {(pk}k=i C 5*^ is an ETF. But what happens for 
other values of 2 7 ^ p S (0,oo)? This was partially answered in [21] for 0 < p < 2 
and recently the case p > 2 was settled [5S]. Before giving more details on this 
case, we first collect a number of generic results about the minimizers of FP^^m 
when M > N >2 and p S (0, 00 ]. 

proposition 3.14. Let p € (0,oo], M,N be positive integers. Let $ = {(pk}kLi C 
5 '^-! we have: 
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(a) If M > TV and 2 < p < oo, then 

FPp,m(<J>) > M(M - + N, 

and equality holds if and only if $ is an ETF. 

(b) Let 0 < p < 2 and assume that M = kN for some positive integer k. Then 
the minimizers of the p-frame potential are exactly the k copies of any 
orthonormal basis modulo multiplications by ±1. The minimum of (3.10) 
over all sets of M = kN unit norm vectors is k'^N. 


(c) Assume that M = N + 1 and set po = 


Assume 


log(Ar) “ log(M-l) 

that FPpj,_Ar($) > A^ + 3, with equality holding if and only if d) = {^k}k=i 
is an orthonormal basis plus one repeated vector or an equiangular FUNTF. 
Then, 

(1) for 0 < p < Po, for any C S^~^, we have FPp_jv+i(iI>) > 

TV+3, and equality holds if and only if $ = {pk}k=i is an orthonormal 
basis plus one repeated vector, 

(2) for Po < p < 2, for any C S^~^, we have FPp_jv+i($) > 

2™ ((A^ + + 1, and equality holds if and only if $ = 

Wk}k=i is an equiangular FUNTF. 


In the special case when N = 2, part (c) of the proposition becomes: 

Corollary 3.15. For A^ = 2, M = 3, and po = the hypothesis of (c) above 

holds. That is, for any $ = {pk}%=i C 5'^, 

FPpc.3($) > 5, 

and equality holds if and only if $ = is an orthonormal basis plus one 

repeated vector or an equiangular FUNTF. 


However, when > 3, it is still unknown if the hypothesis of proposition 3.14 

(c) holds, and it was conjectured in in [29] that with po given in (c), 

FPp„,^+i($)> A + 3 

with equality if and only if $ = {pk}k=i is an orthonormal basis plus one repeated 
vector or an equiangular FUNTF. 

Using Corollary |3.15| one can compute 

Mp,3.2 = min{FPp,2(4>) : $ = {<fk}l=i C 5^} 

for all p G (0, oo] leading to 


forp e (0, 

6 e-piog 2 forp>gg. 

The graph of /ip, 3,2 when p G (0, 00 ) is given in Figure 


A^P.3,2 — 


11 


One can ask about of /ip,M ,2 for other values of M. It follows from proposition 3.14 
(b) that /ip,M.2 = ‘2k"^ for all p G (0, 2] whenever M = 2k is an even integer. For 
p > 2 or odd M some numerical simulations were considered in |85j . For example, 
the following graphs (Figures [I^ and 13) of /ip,M.2 for M G {4,6} were obtained. 

For M = 5 the numerical results suggest that the graph of /ip, 5,2 is as given 


14 Finally, we plot the behavior of /(p,M ,2 as a sequence in M when p G (0,4) is 


shown in Figure [T^ 
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Frame Potential (M = 3) 



Figure 11. Graph of when p € (0,4). 


Frame Potential (M = 4) 



Figure 12. Graph of when p £ (0,4). 


For integer values of p, the minimizers of FP^^m have been investigated in con¬ 
nection with the theory of spherical designs [261 [75]. 
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Frame Potential (M = 6) 



Figure 13. Graph of when p € (0,4). 


Frame Potential (M = 5) 



Figure 14. Graph of /ip, 5,2 when p € (0,4). 

Definition 3.16. Let t be a positive integer. A spherical t-design is a finite subset 
{xi}iLi of the unit sphere in such that, 

1 “ f 

— ^ h{xi) = h{x)da{x), 
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Frame Potentials 



P 


Figure 15. Graph of fJ.p,M ,2 when p s (0,4), M G {3,4,5,6}. 


for all homogeneous polynomials h of total degree equals or less than tin N variables 
and where a denotes the uniform surface measure on normalized to have mass 

one. 


It is easy to see that any spherical t—design is also a spherical t'—design for all 
positive integers t' < t. Spherical 2—designs are exactly FUNTFs whose center of 
mass is at the origin. More precisely we have: 

proposition 3.17. $ = C is a spherical 2-design if and only if $ is a 

FUNTF and Y.k=i = 0. 

We refer to Eg, E3 Theorem 3.2] for details on the proof of the above propo¬ 
sition. Recalling that FUNTFs minimize the frame potential, it is not surprising 
that spherical t-designs also minimize a potential. In particular, 


theorem 3.18. [751 Theorem 8.1] Let p = 2k be an even integer and = 

C S^~^, then 


FPp.M({a:i}“i) > 


1 • 3 • 5 • • • (p - 1) 
iV(iV-F2)---(iV-Fp-2) 




and equality holds if and only if {xi}f'Li is a spherical p-design. 


3.4. Probabilistic p frame potential. The frame potential can be viewed in 
light of mass distributions on the unit sphere. It is therefore natural to look at it 
from a probabilistic point of view. This motivates the the introduction of the larger 
family of potential called probabilistic p-frame potential. 

For p G (0, oo) set 


Vp = {peV : MP{p) = [ \\y\\Pdp{y) < oo}. 
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Definition 3.19. For each p G (0, oo), the probabilistic p—frame potential is given 

by 


(3.11) 


PFP{fj.,p) 


R«xR~ 


\{x,y)\P dp{x)dp{y). 


When /r is a purely atomic measure with atoms on the unit sphere, that is when 
supp(/r) = <i) = C S^~^, PFP(/i,p) reduces to FPp_M given in (3.10). 


This class of potentials is related to the potentials considered by G. Bjfock m- 
More precisely, suppose F C is compact and let A > 0. Bjorck considered the 
question of maximizing the functional 


Ipiy) = 


FxF 


\x - y\\^dp{x)dp{y) 


where p ranges over all positive Borel measures with p{F) = 1. It turns out that 
the techniques used in m to maximize If{p) can be extended to understand the 
minimizers of PFP(/r) when p is restricted to a probability measure on the unit 
sphere in In particular, it was proved in [53 theorem 4.9] that when 

restricted to probability measures p supported on the unit sphere of and when 
0 < p < 2, then the minimizers of PFP(p,p) are discrete probability measures. 
Furthermore, the support of such minimizers contains an orthonormal basis B and 
is contained in the set FB. More specifically we have: 


theorem 3.20. [29l Theorem 4.9] Let 0 < p < 2, then the minimizers of (3.11) 
over all the probability measures supported on the unit sphere are exactly 

those probability measures p that satisfy 

(i) there is an orthonormal basis {ei,..., Cn} for such that 


{ei,..., cn} C supp(p) C {±ei,..., ie^v} 

(ii) there is f : —>■ K such that p{x) = f{x)i'±xi,...,±XKix) and 

f{x^) + f{-Xi) = 

where the measure v±xi,...,±XNix) represent the counting measure of the set 
{±Xi : I = 1,... ,iV}. 


Theorem 3.18 shows that the minimizers of FPp^M when p = 2k is an even inte¬ 
ger, are exactly spherical p—designs. In view of this fact, one can ask whether the 
minimizers of PFP have some special “approximation” properties. This partially 
motivates that following definition in which we denote by A4{S^~^, B) the space 
of all Borel probability measures supported on S^~^. 


Definition 3.21. [53 Definition 4.1] For 0 < p < oo, we call p G Ai{S^ ^,B) a 
probabilistie p-frame for if and only if there are constants A, B > 0 such that 

(3.12) ^llpir < / \{x,y)\Pdp{x) < B\\y\\P, Vp G 

We call p a tight probabilistic p-frame if and only if we can choose A = B. 


By symmetry considerations, it is not difficult to show that the uniform surface 
measure cr on is always a tight probabilistic p-frame, for each 0 < p < oo. 

In addition, observe that we can always take B = 1 in (3.12). Thus to determine 
if a probability measure p on is a probabilistic p—frame one must focus on 
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establishing the lower bound in the above dehnition. When p = 2 this dehnition 
reduces to that of probabilistic frames introduced earlier. In fact, more is true: 

Lemma 3.22. |291 Lemma 4.5] If p is probabilistic frame, then it is a probabilistic 
p-frame for all 1 < p < oo. Conversely, if p is a probabilistic p-frame for some 
1 < p < oo, then it is a probabilistic frame. 


The analogy between tight probabilistic p-frames and spherical t—designs can 
now be made explicitly as one can show the following result which is an analog of 
theorem |3.18| More specihcally, the result below shows that tight probabilistic p- 
frames are the minimizers of the probabilistic frame potential (3.111 when restricted 
to probability measures supported on S^~^, and when p is an even integer: 


theorem 3.23. |291 Theorem 4.10] Let p be an even integer. For any probability 
measure p on , 

PFP{p,p)=j' ( \{x,y)f’dp{x)dp{y)> , 

7siv-i JsN-i N{N + 2) • • • (fV + p - 2) 

and equality holds if and only if p is a tight probabilistic p-frame. 

By combining Theorem |3.18| and Theorem |3.23| we can conclude that whenp = 2k 
there exists a one-to-one correspondence between the class of spherical p—designs 
and the class of discrete tight probabilistic p—frames. More specihcally, every 
spherical p—design supports a discrete measure p which is a tight probabilistic 
p-frame. This is summarized in the following proposition: 

proposition 3.24. Let p = 2A: be an even positive integer. A set 4> = {Fk}^^i C 
is a spherical p—design if and only if the probability measure P<i> = ^ 'l2k=i 
is a tight probabilistic p—frame. 


The question then becomes how to construct tight probabilistic p-frames. When 
restricted to discrete measures and when p = 2fc is an even integer, this problem 
is equivalent to constructing spherical p—designs. This is a difficult problem with 
known solutions only for certain values of p, M, and N. Of course, and as shown in 
Section the special case when p = 2 leads to the FUNTFs. The analytics methods 
developed in [82] are new promising techniques that could be used to investigated 
in general the minimizers of PFP(/i) when p ranges over the probability measures 
on and p > 0. 
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